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Abstract 

Background: Huanglongbing (HLB) is one of the most destructive citrus diseases in the world. The disease is 
associated with the presence of a fastidious, phloem-limited a- proteobacterium, 'Candidatus Liberibacter asiaticus', 
'Ca. Liberibacter africanus' or 'Ca. Liberibacter americanus'. HLB-associated Liberibacters have spread to North 
America and South America in recent years. While the causal agents of HLB have been putatively identified, 
information regarding the worldwide population structure and epidemiological relationships for 'Ca. L. asiaticus' is 
limited. The availability of the 'Ca. L. asiaticus' genome sequence has facilitated development of molecular markers 
from this bacterium. The objectives of this study were to develop microsatellite markers and conduct genetic 
analyses of 'Ca. L. asiaticus' from a worldwide collection. Two hundred eighty seven isolates from USA (Florida), 
Brazil, China, India, Cambodia, Vietnam, Taiwan, Thailand, and Japan were analyzed. 

Results: A panel of seven polymorphic microsatellite markers was developed for 'Ca. L. asiaticus'. Microsatellite 
analyses across the samples showed that the genetic diversity of 'Ca. L. asiaticus' is higher in Asia than Americas. 
UPGMA and STRUCTURE analyses identified three major genetic groups worldwide. Isolates from India were 
genetically distinct. East-southeast Asian and Brazilian isolates were generally included in the same group; a few 
members of this group were found in Florida, but the majority of the isolates from Florida were clustered 
separately. eBURST analysis predicted three founder haplotypes, which may have given rise to three groups 
worldwide. 

Conclusions: Our results identified three major genetic groups of 'Ca. L. asiaticus' worldwide. Isolates from Brazil 
showed similar genetic makeup with east-southeast Asian dominant group, suggesting the possibility of a 
common origin. However, most of the isolates recovered from Florida were clustered in a separate group. While 
the sources of the dominant 'Ca. L. asiaticus' in Florida were not clearly understood, the less-pervasive groups may 
have been introduced directly from Asia or via Brazil. Notably, the recent outbreak of HLB in Florida probably 
occurred through multiple introductions. Microsatellite markers developed in this study provide adequate 
discriminatory power for the identification and differentiation of closely-related isolates, as well as for genetic 
studies of 'Ca. L. asiaticus'. 
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Background 

Huanglongbing (HLB) is one of the most devastating 
diseases of citrus, which is characterized by the develop- 
ment of yellow shoots and stunted growth of infected 
trees combined with a decline in quantity and quality of 
fruit production [1]. HLB-affected fruit are abnormally- 
pigmented, developmentally flawed, and have a bitter 
taste- making them unusable for juice production or as 
table fruit [2,3]. Typically, trees with HLB succumb to 
the effects of infection and die within a few years after 
showing the first signs of the disease [4]. 

HLB is associated with three ' Candidatus Liberibacter' 
species worldwide: 'Ca. L. asiaticus', 'Ca. L. africanus' 
and 'Ca. L. americanus'; the nomenclature is based on 
the presumptive origin of each bacterium in Asia, Africa 
and South America, respectively [1]. HLB has been 
known in Asian countries since the 1870s [1,5,6] and 
found to be associated with the presence of a fastidious 
a-proteobacterium named 'Candidatus Liberibacter 
asiaticus'. In the western hemisphere, it was reported in 
Sao Paulo, Brazil in 2004 and in Florida, USA in 2005- 
two of the largest citrus growing regions in the world 
[1]. Although 'Ca. L. americanus' initially constituted a 
major proportion of the total bacterial population in 
Brazil, this ratio has changed since 2004, and 'Ca. L. 
asiaticus' is now the most prevalent citrus-destroying 
species [4]. Both 'Ca. L. americanus' and 'Ca. L. asiati- 
cus' are transmitted by a psyllid vector, Diaphorina citri 
(also known as the Asian citrus psyllid, or ACP) in Asia, 
North America, and South America [7,8]. The HLB- 
associated Liberibacters can also be transmitted by graft- 
ing propagative material from infected plants onto nur- 
sery stock. The continued economic losses associated 
with HLB are a serious threat to the U.S. citrus industry 
[9]. HLB affects all citrus cultivars [10] and to date 
there are no known HLB-resistant citrus cultivars. 

The genetic structure within a given pathogen popula- 
tion can be a valuable resource for determining the 
source or origin of the pathogen and risk management of 
the disease. The availability of a genome sequence for 
'Ca. L. asiaticus' has provided information on the meta- 
bolic features of this bacterium and key insights into 
HLB pathogenesis [11]. In addition, the genome sequence 
has facilitated the development of DNA markers for 
genetic analysis; these molecular genetic markers are cri- 
tical for understanding the genetic diversity of global 
populations and the epidemiology of HLB. DNA markers 
have been used for characterization of microbial popula- 
tions associated with plant diseases, including RAPDs, 
SNPs, MLST and SSRs (microsatellites) [12-15]. Molecu- 
lar genetic markers not only aid in the general characteri- 
zation of a given population, but can help identify the 
source of an introduced pathogen [16]. 



Among the three HLB-associated Liberibacter species, 
'Ca. L. asiaticus' is the most widespread and is responsi- 
ble for increasing economic losses of citrus industries. 
Much attention has been drawn by researchers in the 
last few years to the importance of understanding the 
epidemiology and ecology of the disease associated with 
'Ca. L. asiaticus'. 'Ca. L. asiaticus' isolates were charac- 
terized in some previous studies; most of these studies 
focused on the Asian continent and utilized conserved 
genes as genetic markers. For example, southeast Asian 
isolates were characterized by sequencing the 16S rDNA 
and 16S/23S regions, omp, the rpl gene cluster, and the 
bacteriophage-type DNA polymerase [17]. The 16S 
rDNA was employed for understanding genetic diversity 
of 'Ca. L. asiaticus' in India [18] and a prophage gene 
was used to reveal variations in China [19]. However, 
genetic variation within conserved genes has limited dis- 
criminatory power to differentiate closely-related isolates 
within populations. 

Microsatellite DNA markers associated with hyper- 
variable sequence regions can provide sufficient resolu- 
tion for differentiating closely-related isolates and for 
tracking genotypes of interest; additionally, these mar- 
kers may help identify the source of invasive strains. 
Recently, similar types of markers have been used for 
differentiating 'Ca. L. asiaticus' in Japan [20]. Chen et 
al. [21] studied populations from Guangdong province 
in China and Florida in the United States. However, 
the single variable locus used in that study provided 
limited characterization of 'Ca. L. asiaticus' genetic 
diversity. 

Here, we report a panel of seven polymorphic micro- 
satellite markers for conducting genetic analyses of 'Ca. 
L. asiaticus' isolates from Asia (India, China, Cambodia, 
Vietnam, Thailand, Taiwan, and Japan), North America 
(Florida, USA) and South America (Sao Paulo, Brazil). 
The microsatellite profile for each isolate was compared 
with all members of the sample set to make predictions 
on the possible origin and dissemination of HLB in 
Florida. 

Results 

PCR amplification and characteristic of microsatellite loci 

A total of 287 'Ca. L. asiaticus' isolates obtained from 
HLB-affected plants across Asia and the Americas 
(Additional file 1 and Table 2) were successfully ampli- 
fied by all seven microsatellite primer pairs. The num- 
ber of alleles and haploid genetic diversity per locus 
ranged from 2 to 30, and 0.204 to 0.881, respectively 
(Table 1). In the clone-corrected data set, genotypic 
linkage disequilibrium was not detected by pairwise 
comparison of loci across the overall isolates (P > 
0.01). 
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Table 1 Characteristics of seven microsatellite markers developed from 'Candidatus Liberibacter asiaticus' 



SSR 
Markers 


Primer sequences (5' 3') 


Repeats 


Location in 
genome 


ORF 




Size range 

lop) 


No. of 
alleles 


H 


LasSSR-A-f 
LasSSR-A-r 


FAM-CGCCTACAGGAATTTCGTTACG 
TCTCATCTTGTFGCTTCGTrTATCC 


(TATTCTG) 8 


255477- 
255753 


adenosine 
deaminases 


50°C 


241-434 


30 


0.881 


LasSSR-B-f 
LasSSR-B-r 


VIC- 

ATCGCCTATAAATCCCTTTACTGATATGTTFCC 
TGGTAACGGAAGTGATAATAACTACAGCAATAAG 


(jrTAA) 6 


669257- 
669458 


hypothetical 
protein 


60°C 


1 96-206 


3 


0.216 


LasSSR-C-r 


\fiC-CC ATTnTTf, ATf, A ATT ACC 
GAATAGAAGAACCCTAAGC 


iTAGTIn 


666722- 
666947 


ui ujui iui lyuiuidjCj 


50°C 


208-254 


15 


0 61 3 


LasSSR-D-f 
LasSSR-D-r 


N E D-CGGTGTCGGTATCGGTATC ATTC 
CGAAGAAGAGACGGAGGTFAAGC 


CTFOs 


377678- 
377850 


hypothetical 
protein 


55°C 


158-174 


3 


0.391 


LasSSR-E-f 
LasSSR-E-r 


NED-GATCAGTAGTCTATCACCAC 

TACTGGAAACAAATGGAATAC 


(CTTGTGT) 5 


354424- 
354613 


transcriptional 
regulator 


50°C 


173-290 


17 


0.587 


LasSSR-F-f 
LasSSR-F-r 


FAM-TCGTCTTATCGTATATGACTGC 

TTCACTATTAAAGGATGAAGGC 


(TTTACATC) 

3 


520542- 
520307 


repair ATPase 


52°G 


227-235 


2 


0.204 


LasSSR-G-f 
LasSSR-G-r 


FAM-CGGGAGAAATTAAAGATGATGG 
CGCTGTTAATACATACTTACGG 


(TTGTTGGA) 

2 


998251- 
998403 


hypothetical 
protein 


53°C 


139-152 


2 


0.204 



T a , annealing temperature of the primer pairs; H, Haploid genetic diversity 

Each forward primer was labeled with FAM, NED, VIC fluorescent dyes at 5', respectively 



Genotype and genetic diversity 

A total of 117 genotypes (haplotypes) were identified 
(Additional file 1). Haplotypes identified within the sam- 
ple population were restricted to the boundaries of their 
country of origin. The genetic diversity observed in dif- 
ferent countries and locations are summarized in Table 
2. Isolates from China possessed the largest number of 
alleles (5.7 alleles per locus), followed by India (5.4 alleles 
per locus). Overall haploid genetic diversities were the 
highest in Asian countries, followed by Brazil. The hap- 
loid genetic diversity of the Florida (USA) isolates was 
lowest among all the geographic groupings (Table 2). 

Genetic structure 

A UPGMA clustering analysis identified three major 
groups of 'Ca. L. asiaticus' (Figure 1). Isolates from India 
were clustered in a distinct group (group 3). Most of the 
isolates from China and other Asian countries, and Brazil 
were generally grouped in group 1. While some isolates 
from Florida occurred in group 1, most isolates from 
Florida were clustered in group 2 (Figure 1). 

The STRUCTURE analysis based on Bayesian model- 
ing further assessed the genetic structure of 'Ca. L. asia- 
ticus'. This approach utilizes statistical methods to 
determine the relationships among the isolates without 
prior population information. In the analysis three dif- 
ferent clusters (K) were identified based on the ad hoc 
statistic AK [23] (Figure 2). The membership of each 
individual isolate obtained from STRUCTURE analysis, 
can be estimated as (q), the ancestry coefficient, which 
varies on a scale between 0-1.0, with 1.0 indicating full 
membership in a population. Individuals can be assigned 
to multiple clusters (with values of q summing to 1.0) 
indicating they are admixed. Individual samples with q > 



0.90 (ancestry coefficient) were considered as having sin- 
gle lineage and individuals with q < 0.90 were consid- 
ered as admixed lineages as followed by Williams et al. 
[24]. The result of STRUCTURE analysis is consistent 
with UPGMA in which isolates from India were grouped 
in a distinct cluster (Figure 2 in yellow). Brazilian and 
most east-southeast Asian isolates were clustered as a 
single lineage (q > 0.90) (Figure 2, red). Some isolates 
taken from central Florida (Polk, Pasco, and Lake Coun- 
ties) shared the same lineage with east-southeast Asian 
and Brazilian isolates (Figure 2, red). Most Florida iso- 
lates, however, grouped in a different cluster (Figure 2, 
green). Some admixed isolates (q <0.90) were found in 
Florida as well as in Baise and Nanning of Guangxi pro- 
vince in China, and in Cambodia. 

eBURST analysis with user-defined criteria (based on 
the analysis of haplotypes that shared identical genotypes 
for at least 5 of the 7 loci) predicted three founder haplo- 
types: haplotype-108 (Nanning, Guangxi province, 
China), haplotype-48 (Sao Paulo, Brazil) and haplotype- 
46 (Tirupati District, Andhra Pradesh, India) (Additiontal 
file 1 and Figure 3). The diagram generated by eBURST 
showed a primary network between haplotype-103 and 
107 (Collier County, Florida) and predicted founder hap- 
lotype in China. A primary network was also identified 
with haplotype-51 (Pasco County, Florida) and the sec- 
ond predicted founder haplotype in Brazil. Haplotype-46 
from Tirupati District, Andhra Pradesh, India) was pre- 
dicted to be the third founder and hypothesized to be the 
founder haplotype of 'Ca. L. asiaticus' in India. 

Discussion 

Characterization of worldwide and regional 'Ca. L. asia- 
ticus' populations from HLB-affected plants can 
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Table 2 Descriptive statistics and genetic diversity of 'Candidatus Liberibacter asiaticus' isolates across seven 
microsatellite loci in the samples obtained from nine different countries from Asia, North (Florida, USA) and South 
Americas (Sao Paulo, Brazil) 



Country 


Location 
ID 


Location Information 


Total number of 
individuals 


Number of 
individuals in 
clone corrected 
data 


Alleles per 
locus 


Haploid genetic 
diversity 


Brazil 


BRA 


Sao Paulo 


22 


14 


2.7 


0.313 


USA 


FL-A 


Charlotte County, Florida 


5 


A 


1.6 


0.161 




FL-B 


Collier County, Florida 


-16 


11 


2.1 


0.234 




FL-C 


DeSoto County, Florida 


30 


5 


1.7 


0.194 




FL-D 


Hardee County, Florida 


8 


5 


1.7 


0.160 




FL-E 


Hendry County, Florida 


13 


5 


1.6 


0.171 




FL-F 


Highlands County, Florida 


19 


6 


1.7 


0.119 




FL-G 


Indian River, County, Florida 


23 


7 


1.9 


0.175 




FL-H 


Martin County, Florida 


10 


7 


1.9 


0.175 




FL-I 


Okechobee County, Florida 


A 


2 


1.3 


0.143 




FL-J 


Polk County, Florida 


6 


4 


2.0 


0.304 




FL-K 


St. Lucie County, Florida 


6 


A 


I .A 


0.179 




FL-L 


Pasco County, Florida 


2 


2 


1.1 


0.071 




FL-M 


Manatee County, Florida 


2 


2 


1.3 


0.143 




FL-N 


Hillsborough County, Florida 


2 


2 


1.3 


0.071 




FL-0 


Lake County, Florida 


1 


1 


l.O 


0.000 




USA-Florida-overall 


177 


67 


3.6 


0.247 


CHINA 


CHN-A 


Baise, Guangxi Province 


3 


2 


1.1 


0.071 




CHN-B 


Guilin, Guangxi Province 


3 


3 


1.4 


0.190 




CHN-C 


Hezhou, Guangxi Province 


3 


2 


1.3 


0.143 




CHN-D 


Nanning, Guangxi province 


14 


14 


3.0 


0.276 




CHN-E 


Fuzhou, Fujian Province 






2 I 


0 320 




CHN-F 


Tangshan, Yunnan Province 




2 


1 3 


0 1 43 




CHN-G 


Gangzhoou, Jiangxi Province 






1 0 


0 000 




CHN-H 


Guangzhou, Guangdong Province 


] 


i 


1 0 


0 000 




CHN-I 


Hunan Province 






1 o 


0 000 




China-overall 


JO 


j 1 


c 7 
J./ 






CAM-A 


Pursat Province 


7 


f. 


2 A 


0 341 




CAM-B 


Battambang Province 


A 


4 


1.9 


0.304 




Cambodia-overall 


11 


10 


3.1 


0.423 


VIETNAM 


VIET 


Hung Yen Province, Hoa Binh 
Province, Hanoi 


3 


3 


1.9 


0.317 


THAN AND 


THAI 


Unknown 


] 


1 


1 0 


0 000 


TAIWAN 

1 Ml WHIN 


TIW 


Unknown 


i 
i 


] 


1 o 


0 000 


JAPAN 


JPN 


Unknown 


] 


] 


1 o 


0 000 


INDIA 


IND-A 


Anantapur District, Andhra Pradesh 


7 


1 


2.4 


0.297 




IND-B 


Chittoor District, Andhra Pradesh 


6 


6 


2.0 


0.254 




IND-C 


Kadapa District, Andhra Pradesh 


A 


4 


1.9 


0.250 




IN D-D 


Mahaboobnagar District, Andhra 
Pradesh 


3 


3 


1.4 


0.159 




IND-E 


Nalgonda District, Andhra Pradesh 


A 


A 


1.7 


0.196 




IND-F 


Prakasam District, Andhra Pradesh 


A 


3 


2.4 


0.540 




IND-G 


Tirupati District, Andhra Pradesh 


5 


5 


2.0 


0.274 




IND-H 


Kurnool District, Andhra Pradesh 


1 


1 


l.O 


0.000 




IND-I 


Ludhiana District, Punjab 


1 


1 


1.0 


0.000 




India-overall 


35 


34 


5.4 


0.360 



Allele per locus: average number of alleles per locus 

Clone corrected data (removed repeated genotypes within a population) 
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Sao Paulo, Braal 

Pasco C ountry- Florida -USA 

Lake County, Florida, USA 

rinsslun YunnanProvincs. China 

Thailand 

Guiiti, Guanexi Province, China 
Pursat Province. Cambodia 
Hezhou, Guancii Province, Chnia 
Gangdioou, Jiansxi Prolines, China 
Nattiiiif , Guan?\i proviiEe, China 
Guansdiou, Guanadons Province, China 
F'^ziicu Fujim? revues. China 



- Hunan Province. China 

- Charbtte County Florida, USA 
-HendryCounty Florida, USA 
-Eishands County, Fbriia, USA 

- ?.darto County Florida, USA 

E, Lucie County. Florida, USA 

- Hvlsborouih County, Florida, USA 

- Collier County, Florida, USA 

- Indian River. County. Florida. USA 

- De S oto C oun ty, Florida, US A 
-Hardee County Florida, USA 

- Okeehobee County Florida, USA 

- Manatee County. Florida. USA 
-Pott County Florida, USA 

- Batss, Gitaiurxi Province, China 

- Bittimb he Prcvirc:. C imb cdia 

- Anantapur District, Andhra Pradesh, India 



Group-1 (Brazil and, east- 
south east Asia dominant) 



- Group-2 (Florida, USA dominant) 



- Chi toot District Andhra Pradesh., India 

- Nateonda Dnuxt Andhra Pradesh, India 
-Prakasam District, And ht a Pradesh. Inda 
- Kadapa District, Andhra Pradesh, Inda 

- Malta bcobnasar District, Andhra Pradesh. India 

- Tirupati District Andhra Pradesh, India 

- Ludliim. District Punjab, Inda 
-Prakasam District, Andhra Pradesh, India 



Figure 1 UPGMA dendrogram showing the genetic relationships of 'Candidatus Liberibacter asiaticus' isolates from different locations 
within an individual country as well as from different countries (from Asia and Americas) Clone-corrected data were used for 
constructing the dendrogram based on DA distance [22]. Only bootstrap values > 25% are shown. 



facilitate identification of introduction patterns and pre- 
dict the possible relationships of HLB-associated Liberi- 
bacters found in different citrus growing regions. 
Multilocus microsatellite marker analysis can provide 
sufficient resolution for differentiating closely-related 
isolates and can be useful for tracking genotypes of 
interest; additionally, these markers may help identify 
the source of invasive strains. 

In this study, seven microsatellite markers successfully 
genotyped 'Ca. L. asiaticus' from global populations. 
Sequence analysis indicated that three of the microsatel- 
lites appear to overlap with microsatellites recently 
developed by Katoh et al. [20]. Various microsatellite 
length variations were found in 'Ca. L. asiaticus' from 
worldwide collections, with some loci having as many as 
30 alleles. 

Historical evidence reviewed by da Graca [25] sug- 
gested that HLB was observed in Guangdong province, 
China in the late 19th century [26], and later spread to 
other parts of the country. It is assumed that HLB may 
have been introduced into China from India along sea 
trade routes [27]. The first record of HLB-like symp- 
toms, referred to as 'dieback', was reported from India 
in the 18th century [28]; this was later suggested to be 
HLB [29]. As 'Ca. L. asiaticus' has been in Asian coun- 
tries over a century, the genetic diversity in Asian popu- 
lations was expected to be high, due to a longer period 
of mutation accumulation, population differentiation 
and natural selection. As hypothesized, a higher degree 
of genetic diversity for 'Ca. L. asiaticus' was observed in 



both China and India within the present study (Table 
2). In contrast, the lower level of allelic and haploid 
genetic diversity of 'Ca. L. asiaticus' in Florida and Brazil 
populations are consistent with the hypothesis that 'Ca. 
L. asiaticus' populations in these regions have been 
derived from recent introductions [30]. 

Human movement of infected plant materials is 
probably the main cause of long distance dissemination 
of both 'Ca. L. asiaticus'-positive psyllids and HLB- 
affected plant material. The distributions of haplotypes 
observed in 'Ca. L. asiaticus' in this study did not 
detect any identical haplotypes from different conti- 
nents or even from different countries within the same 
continent (Additional file 1). This result does not 
exclude the possibility of contemporary migration of 
'Ca. L. asiaticus' among different countries through the 
movement of infected plant materials or by the migra- 
tion of vector psyllids as rapid mutation and selection 
could lead to deviation of populations from their origi- 
nal sources. The vector, D. citri, has been in Brazil for 
over 60 years without any sign of HLB until its discov- 
ery on 2004 [4,25]. D. citri was discovered in Florida in 
Palm Beach, Broward and Martin counties in 1998 and 
has spread throughout the state since that time [7]. 
However, it is not clear when 'Ca. L. asiaticus' was 
introduced into Brazil and Florida. The rapid spread of 
HLB in Brazil and Florida after the first reports in 
2004 and 2005, respectively, suggests that HLB was 
most likely introduced around that time, rather than 
evolving from an indigenous source. 
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Sao Paolo, Brazil 

Charlotte County, Florida, USA 
Collier County, Florida, USA 

OeSoto County, Florida, USA 
Hard** County, Florida, USA 
Hon dry County. Florida, USA 
Highlands County. Florida, USA 

Indian Rlv«r, County. Florida. USA 

Martin County. Florida, USA 

Okoc hobee County, Florida, USA 
Polk County, Florida, USA 

St. Lucie County, Florida, USA 

Pasco County, Florida, USA 
Manatee County, Florida, USA 
Hillsborough County, Florida. USA 
Lake County, Florida, USA 
Balsa, Guangxl Province, China 



Guilln. Guangi 



I Province, Chin 
I Province, Chin 



Manning, Guangxl province, China 



Fuzhou, Fujian Province, China 

Tangshan, Yunnan Province, China 

Gangzhcipu, Jlapgxl Province, China 

GuanqzrYOu, GuangddTig Province, Cp ]na 
Hunan Province, China 

Pursot Province, Cambodia 

Battambang Province, Cambodia 
Vietnam 

Annnlopur District, Andhra Pradesh, India 

Chittoor District. Andhra Pradesh. India 

Kadnpn District. Andhra Pradesh, India 

(lohaboobnagar District, Andhra Pradesh, India 

Nolgonda District, Andhra Pradesh, India 

Prakasam District. Andhra Pradesh. India 

Tlrupatl District Andhra Pradesh, India 

Kurnool, District. Andhra Pradesh, India 
Ludhlana District. Punjab. India 




CAM- 1 
VIET 

m 



— Sao Paulo, Brazil 



FL-C 
FL-D 
FL-E 
FL-F 



FL-I 

FL-J 



FL-N 
FL-O 
CHN-A 

CHN-B 

CHN-C 



CHN-E 
CHN-F 

CAM-A 



IND-B 

IND-C 
I M D-D 

IND-E 
IND-F 
IND-G 



Florida, I SA 



Vietnam 

Thailand, Taiwan, Japan 



Figure 2 Individual assignments of 'Candidates Liberibacter asiaticus' isolates obtained from nine different countries from Asia and 
Americas by STRUCTURE analysis. There were three clusters (K). Black lines within the squares distinguish geographic locations. 



As suggested by the historical evidence and review of 
the early literature related to HLB, the most ancient 
population of 'Ca. L. asiaticus' perhaps originated in 
India. From the 20 th century onward, HLB spread 
through much of the citrus-growing regions of south 
and southeast Asia [2], the Arabian peninsula [31], East 
Timor and Papua New Guinea [32], and the western 
hemisphere (Brazil and the United States) [1]. It is diffi- 
cult to precisely know when the disease entered each 
country and from where it was introduced. Frequent 
shipment of plant materials and unlawfully importation 



of plants has increased the risk of disseminating exotic 
plant pathogens around the world. The exact pathways 
responsible for introducing HLB and the Asian citrus 
psyllid into the United States and Brazil have not yet 
been determined. 

The genetic relationships of the isolates in this study, 
as determined from the UPGMA based on Nei's genetic 
distance [22] and individual based clustering analysis by 
the STRUCTURE analyses, consistently identified three 
major genetic groups of 'Ca. L. asiaticus', with isolates 
from India included in a distinct genetic group (Figure 2 
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Associated diagram of eBURST analysis 



Network between Florida 
haplotypes and founder 
haplotypes m Brazil 

.36 




Network between Florida 
haplotypes and founder 
haplotypes in China 




Figure 3 Network diagram (based on nearly identical haplotypes that differed by two loci) from eBURST analysis. Solid blue circles in 
the diagram indicate three predicted founder haplotypes: China (Haplotye-108), Brazil (Haplotype-48) and India (Haplotype-46). A primary 
network was observed between haplotype-103 and 107 (Florida), and predicted founder haplotypes in China, and between haplotype-51 
(Florida) with predicted founder haplotypes in Brazil, suggesting two separate introductions of 'Ca. L. asiaticus' into Florida. 



and Figure 3). The similar genetic makeup amongst 
most isolates from east-southeast Asia and South Amer- 
ica (Sao Paulo, Brazil) support the hypothesis of the 
introduction of 'Ca. L. asiaticus' into South America 
from East Asia or Southeast Asia. While most isolates 
from Florida were clustered within a separate group, 
both UPGMA and STRUCTURE analyses showed that 
some isolates from central Florida overlapped with east- 
southeast Asian and Brazilian groups. The presence of 
two genetic groups in Florida suggests at least two 
introduction events are associated with the recent out- 
break of HLB in Florida. 

Based on the history of HLB, it could be predicted 
that populations of 'Ca. L. asiaticus' in Florida were 
most likely established following the introduction of 
HLB-affected plant materials or 'Ca. L. asiaticus'-carry- 
ing psyllid from Asia or other countries through 
human-mediated transport. The analyses in this study 
do not support the hypothesis of introduction of HLB 
into the Americas through biological materials sourced 
from India. Only a single isolate from India (Prakasam 
District, Andhra Pradesh) overlapped with the east- 
southeast Asian and Brazilian group (Figure 2, red). 
STRUCTURE analysis revealed that less-dominant clus- 
ters (Figure 2, red) in central Florida (Polk, Pasco, and 
Lake Counties) were observed in the same lineage (q > 
0.90) with east-southeast Asian and Brazilian clusters 
suggesting that the origin of members of this cluster in 
Florida might be derived from Asia or via Brazil. More- 
over, some admixed (q < 0. 90) isolates between Florida 
and east-southeast Asia also support the hypothesis of 
introducing 'Ca. L. asiaticus' into Florida from Asia. 

eBURST analysis provided further insights into the 
origin of 'Ca. L. asiaticus'. Founder haplotypes were 
identified from China, Brazil, and India. Based on their 
position within the eBURST network, these founders are 



predicted to have given rise to the three global genetic 
groups, consistent with prevailing theories of the geo- 
graphic origins of HLB [1,2,4,7]. While one founder type 
was predicted in Brazil, the similar genetic makeup of 
Brazilian and east-southeast Asian isolates suggest that 
this founder could have been introduced into Brazil 
from any of these Asian countries. Consistent with the 
STRUCTURE analysis, the eBURST diagram also pre- 
dicted the introduction of 'Ca. L. asiaticus' into Florida 
citrus groves through at least two separate introduction 
events. While a primary network was detected between 
a founder haplotype from China and two unique haplo- 
types in Florida, clear differentiation was observed 
between most isolates from China and Florida by Baye- 
sian clustering and UPGMA analyses. Differences 
between the dominant groups found in Florida and 
China were also reported in a recent study using a sin- 
gle VNTR locus [21]. It is uncertain whether the domi- 
nant group of Florida isolates were introduced en masse 
or if a small population of nearly-identical 'Ca. L. asiati- 
cus' haplotypes from China were introduced, evolved 
quickly, and established a large population. The recent 
discovery and rapid spread of HLB in Florida, along 
with wide distribution of dominant 'Ca. L. asiaticus' 
group observed in the present study suggests that iso- 
lates of this group have been directly introduced from 
an unknown location. Another recent study also indi- 
cated that some isolates of 'Ca. L. asiaticus' from Florida 
may have been introduced through two different events, 
and sources were unknown [21]. The analyses of micro- 
satellites in the present study, however, suggest that the 
introduction of the less-dominant cluster was likely 
from a single source either Asia or Brazil. The low 
occurrence of less dominant group in some central 
counties in Florida suggests that the members of this 
group were perhaps introduced more recently (Figure 
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4). However, it is certainly plausible that these two hap- 
lotypes were introduced into Florida at nearly the same 
time. Isolates from one of the sources may have spread 
quickly due to selective advantage under a favorable set 
of biological or environmental conditions. 

Our analysis showed that a dominant group of 'Ca. L. 
asiaticus' genotypes are widely distributed in south-cen- 
tral Florida (Figure 4). HLB is highly invasive and spatial 
spread of the disease is rapid compared to other arbor- 
eal pathosystems [4] . Increased diversity of host, patho- 
gen, vector, and environmental conditions likely 
influence the rates of HLB distribution. Moreover, the 
rates of HLB increase are directly related to increase 
and spread of the psyllid vector population: in June 
1998, the Asian citrus psyllid was first detected in Palm 
Beach County; within two years of this discovery the 
disease occurred to 31 counties in Florida [33]. The vec- 
tor is now present in nearly all citrus-growing areas of 
Florida [34]. In Florida, HLB was first discovered in 
Miami-Dade County in August 2005, seven years after 
detection of the vector in the same region [30]. By mid- 
October, the disease was found in many residential 
properties stretching northwards more than 250 km 
from Miami-Dade County to St. Lucie County and sev- 
eral commercial citrus groves were also affected in Palm 
Beach and Hendry Counties [1]. However, no epidemio- 
logical survey has clearly demonstrated HLB or 'Ca. L. 
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Figure 4 Sample distribution of 'Candidatus Liberibacter 
asiaticus' from 15 citrus-growing counties (gray highlighted) in 
Florida, USA. Green circles indicate the counties where only the 
dominant 'Co. L. asiaticus' group were observed based on 
STRUCTURE analysis (green in Figure 2). Some isolates from Polk 
County (13), Pasco County (14) and Lake County (15) were included 
with the genetic group 2 (less dominant group) (see Figure 2). 



asiaticus'-carrying psyllids being introduced into the 
southern part of Florida and then spreading northward 
through the continuous movement of psyllid vectors. 
Since 2005, HLB has spread to most citrus-producing 
counties in Florida [34,35]. The rapid and widespread 
distribution of this disease among citrus growing coun- 
ties in Florida is most likely due to the result of the 
multiple secondary introductions of HLB-associated 'Ca. 
L. asiaticus'. Based on the present analyses, it appears 
that there were at least two 'Ca. L. asiaticus' introduc- 
tion events in Florida. Moreover, the rapid distribution 
of HLB within Florida after 2005 is concomitant with 
the discovery of a dominant genetic cluster within 
south-central Florida. Taken together, this suggests that 
dominant 'Ca. L. asiaticus' haplotypes, possibly from dif- 
ferent countries may have established a population 
within Florida through multiple introduction events. 

Conclusions 

The seven microsatellites developed in this study are 
useful for detection, isolate differentiation, and genetic 
analysis of 'Ca. L. asiaticus'. Our results showed that 
current 'Ca. L. asiaticus' populations in HLB-affected 
citrus in Asia and the Americas are comprised of three 
distinct genetic groups: (1) Indian, (2) predominantly 
east-southeast Asian and South American (Brazil) and 
(3) predominantly North American (Florida, USA). 
While regional differences were observed from the dis- 
tribution of dominant clusters, the similar genetic 
makeup of east-southeast Asian and Brazilian isolates 
lead us to hypothesize that 'Ca. L. asiaticus' populations 
in Brazilian groves were most likely introduced from 
east or southeast Asia. The precise sources of the domi- 
nant genetic group of 'Ca. L. asiaticus' retrieved from 
Florida are not clearly resolved from the present analy- 
sis. However, less-pervasive groups may have been intro- 
duced directly from Asia or via Brazil. While the results 
here provide some insights into the origins and genetic 
structure of HLB-associated 'Ca. L. asiaticus', it should 
be noted that broader population analyses using a larger 
array of molecular markers will help resolved the ques- 
tions on the origin and dissemination of HLB-associated 
'Ca. L. asiaticus. 

Methods 

Sample collection/DNA extraction 

DNA from HLB-affected samples from Asia (India, 
China, Cambodia, Vietnam, Thailand, Taiwan, and 
Japan), North America (Florida, USA) and South Amer- 
ica (State of Sao Paulo, Brazil) were extracted from the 
respective sources and sent as microbially-sterile and 
non-infectious samples. HLB-associated Liberibacter-free 
DNA samples were used as negative controls. Basically, 
leaf samples were collected from citrus trees with 
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blotchy mottle and blotchy mottle-like symptoms. 
Leaves were washed under running tap water and 
blotted dry with paper towels. The midribs were then 
excised from the leaf blade. Total genomic DNA was 
extracted from 4-5 midribs per sample. Samples were 
ground in liquid nitrogen and DNA was extracted using 
the CTAB method. Precipitated DNA was dissolved in 
100 ul of TE buffer. The quality of DNA samples was 
checked by electrophoresis in 1.2% agarose gels. DNA 
samples were diluted 30 times with water for PCR. 

Microsatellite marker development 

To identify putative microsatellite regions in the 'Ca. L. 
asiaticus' genome, we used the program 'Tandem 
Repeats Finder' [36]. Following the identification of 
these regions, primers were designed (Eurofins-Operon) 
that flanked the prospective repeat sequence to generate 
a product of 150-400 base pairs. Over 100 primer sets 
were tested using multiple DNA samples obtained from 
HLB-affected plants from India, China, Brazil and Flor- 
ida. We postulated that polymorphisms, if present, 
should be observed within this pilot sample due to their 
geographic separation. Following amplification of 
regions containing putative microsatellite using the test 
primers, the products of each reaction were then run on 
5% of polyacrylamide gels. Silver staining was then used 
to visualize polymorphic alleles. This screening proce- 
dure identified seven loci with amplified sequence length 
variability. To facilitate high-throughput genotyping ana- 
lysis, each of seven forward primers was labelled with a 
fluorescent dye (Table 1). Amplified products were ana- 
lyzed by an ABI 3130 xl Genetic Analyser (Applied Bio- 
systems, Foster City, CA). 

PCR based genotyping 

PCR was performed in 20 ul containing 2 ul of lOx 
reaction buffer, 1.5 mM MgCl 2 , 0.2 mM dNTPs, 0.25 U 
AmpliTaq Gold (Applied Biosystems, Foster City, CA), 
2.5 pmole of each of SSR primer pairs and 2 ul of 
diluted DNA sample. PCR was conducted in the follow- 
ing conditions: 94°C for 4 minutes; 40 cycles consisting 
of 94°C for 45 seconds, annealing temperature (Table 1) 
for 45 seconds, and 72°C for 45 seconds; then a final 
extension at 72°C for 7 minutes. The successes of ampli- 
fications were checked running 5 ul of amplified pro- 
ducts in agarose gel electrophoresis using 2.5% agarose- 
TBE gels. For multiplex analysis, 2 ul of each amplified 
product labeled with different fluorescent dye was 
pooled. From 8 ul of pooled product, 2.5 ul was mixed 
with 0.25 ul of GeneScan-500 Liz molecular size stan- 
dard (Applied Biosystems Cat #4322682A) and 7.25 ul 
of Hi-Di Formamide (Applied Biosystems Cat. 
#4311320). The mixture of products was then loaded 
onto a Genetic Analyzer (Applied Biosystems, Foster 



City, CA) equipped with the 36 cm 16-capillary array 
filled with POP-7 polymer (Applied Biosystems, Foster 
City, CA). Data acquisition and fragment size determi- 
nations were carried out by GeneMapper v4.0 software 
(Applied Biosystems, Foster City, CA). 

Genotypes and genetic diversity analysis 

Genotypes were identified based on combination of alle- 
lic data from multiloucs microsatellite loci. A clone-cor- 
rected (removing repeated genotypes within a 
population) data set was built and used for the analysis 
of genetic diversity, linkage disequilibrium and genetic 
structure. GenAlEx Version 6.3 [37] was used to calcu- 
late the average number of alleles (Na) and haploid 
genetic diversity (H) at each locus as well as across all 
loci for each of the populations. 

Linkage disequilibrium analysis 

A global test (Fisher's method) implemented in GENE- 
POP web version 4.0.10 [38] was used to test for the 
genotyping linkage disequilibrium (LD) between all pair 
of loci across all samples in this study. 

Genetic structure analysis 

To determine the genetic relationships of 'Ca. L. asiati- 
cus'isolates, a UPGMA dendrogram was constructed 
based on Nei's genetic distance [22]. The trees were cal- 
culated using POPULATION software package Version 
1.2.31 (Olivier Langella, CNRS UPR9034, France found 
at web: http://bioinformatics.org/~tryphon/populations/) 
and graphically displayed with MEGA4 software [39]. 
Confidence in specific clusters of the resulting topology 
was estimated by bootstrap analysis with 1,000 
replicates. 

The program STRUCTURE 2.3.1 [40] was also used 
for a clustering algorithm based on a Bayesian model to 
assign individual isolate of 'Ca. L. asiaticus' to a speci- 
fied number of clusters. This algorithm assumes a 
model in which there are K clusters (where K may be 
unknown), each of which is characterized by a set of 
allele frequencies at each locus. No linkage disequili- 
brium was detected between all pairs of loci across all 
samples with the clonal corrected data set. Therefore, 
the program STRUCTURE 2.3.1 [40] was rationally used 
to estimate the number of clusters (K) within 'Ca. L. 
asiaticus' where 10 independent runs of K = 1-10 were 
performed without any prior information as to the ori- 
gin (location) of individual samples. For each run, a 
burn-in period of 25,000 iterations was used followed by 
a run length of 50,000 Markov chain Monte Carlo itera- 
tions, and a model with correlated allele frequencies and 
admixture among populations. The model was run with 
10 independent simulations for each K. The number of 
clusters that best represents the observed data was 
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determined by maximizing the estimated In likelihood of 
the data for different values of K, and the AK index 
which is based on the rate of change in the In likelihood 
of the data between successive K(l-10) [23]. The opti- 
mal probabilities for all individuals were estimated from 
10 replicate runs at K = 3 with permutation analysis 
using CLUMPP version 1.1.2 [41], and the output of 
genetic clustering was visualized using software DIS- 
TRUCT version 1.1 [42]. 

To provide further insight into the relationships 
among 'Ca. L. asiaticus', the eBURST v3 program http:// 
eburst.mlst.net/ was employed to identify putative foun- 
der types. For this analysis, user- defined group defini- 
tion was set to include those haplotypes that shared 
identical genotypes for at least 5 of the 7 loci. The mini- 
mum single-locus variant count for subgroup definition 
was set to 3. 

Additional material 



Additional file 1: Sample and haplotype information for all isolates 
used in this study 
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